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Abstract 

In the bus network problem, the goal is to generate a plan for getting from point 
X to point Y within a city using buses in the smallest expected time. Because bus 
arrival times are not determined by a fixed schedule but instead may be random, 
the problem requires more than standard shortest path techniques. In recent work. 
Datar and Ranade provide algorithms in the case where bus arrivals are assumed to 
be independent and exponentially distributed. 

We offer solutions to two important generalizations of the problem, answering 
open questions posed by Datar and Ranade. First, we provide a polynomial time 
algorithm for a much wider class of arrival distributions, namely those with mrr^r/*ur; 
failure rate. This class includes not only exponential distributions but also uniform, 
normal, and gamma distributions. Second, in the case where bus arrival times are 
independent and geometric discrete random variables, we provide an algorithm for 
transportation networks of buses and trains, where trains run according to a fixed 
schedule. 


’Supported in part by a grant from the Alfred P. Sloan Foundation and an equipment grant from 
Compaq Computer Corporation. 



1 Introduction 


Imagine I rving to t ravel across a cil v by bus. with I lie goal of minimizing the exported total 
travel time. There may be several dillerent, possible routes, with some requiring (‘hanging 
buses. If buses followed a lixed schedule, then standard shortest-pat h techniques would be 
sufficient to find the best travel plan. However, bus arrivals rarely lollow a fixed schedule 
(even when they are supposed to). Bus arrivals are more naturally modeled as a random 
process, in which case a natural goal is to develop a plan that minimizes the total expected 
travel time. Although this bus network problem appears specific, it is representative of 
a wide class of scheduling problems where an appropriate plan must be developed with 
incomplete information modeled probabilistically. 

The bus network problem was recently examined by Datar and Ranade. in the case 
where arrival distributions are independent Poisson processes, i.e. the interarrival times are 
exponentially distributed, with the mean of each distribution fixed for all time. Their results 
are based on the key insight that in this case, the optimal plan is composed of statements 

of the following form: “When at station i. wait for one of buses X ll: A ' l2 A',*,; take the 

first of these buses that arrives.” Moreover, they show that because optimal plans have such 
a simple form, they can be calculated in polynomial time using a dynamic programming 
algorithm. 

On reflection it is clear that the simple form of the optimal plan is highly dependent 
on the assumption of independent Poisson arrival processes with fixed means. (We will see 
examples below.) This assumption is problematic: indeed, the authors admit. "Perhaps 
the most unfounded assumption in our model is that of Poisson arrivals of the buses.” 

As our first result, we show that an optimal plan has only a slightly more complex 
form when the arrival distributions for buses are assumed to be independent and have 
increasing failure rate. Intuitively, the waiting time for a bus has increasing failure rate if 
the luiigci you wait, the more likely ..:e bus is about to arrive. Many natural models - 
including uniform, normal, and gamma distributions — have increasing failure rate, so our 
result may be much more appropriate for real-world data. We describe how the optimal 
schedule in this case can be determined in polynomial time, assuming that we can compute 
with the relevant probability distributions in an effective manner. Finally, we demonstrate 
that assuming a slightly weaker property than increasing failure rate for the bus arrival 
distributions is insufficient for our results. 

As our second result, we partially answer another open question posed by Datar and 
Ranade: how can we handle both buses and trains in our transportation network? Here 
we use the term trains to represent transportation running on a fixed schedule, as opposed 
to buses which arrive according to a random distribution . 1 W r e demonstrate how to solve 
this problem in the case where time is discretized and interarrival times for buses are given 
by discrete geometric random variables. Note that discrete geometric random variables 

1 We acknowledge that our use of the terms buses and trains may be inaccurate for practice; still, they 
are useful. 
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pmvidr natural approximations lor continuous cxpoiicni ial random variables. where the 
accuracy o! the approximation depends on (lie granularity of the time interval for tin* 
discretization. Hence our result can he used to approximate the continuous Poisson arrival 
case. Although our solution is polynomial in tin* number of time steps modeled, we believe 
it may he ellective lor problems ol a reasonable si/e, 

1.1 Related work 

The earliest reference we have found to bus network problems is by Hall [4j. The starting 
point ol our work is the recent paper by Datar and Ranade. who solved the problem of 
bus transportation networks when all bus arrivals are independent and Poisson [3]. An 
interesting aspect ol this work is the on-line decision making process of the traveler, who 
chooses whether or not to take a bus as it arrives. Previous approaches required schedules 
that force the rider to commit to a single transit choice upon arriving at a stop, rather 
than flexibly choosing based on what bus gets there first [4. 9]. 

We also view this work as an interesting connection between algorithmic analysis and 
Markov decision processes. For more background on Markov decision processes, see for 
example [1, 6]; we offer a brief description here. In a Markov decision process, there is an 
underlying Markov process with associated actions and rewards. Bv choosing an action 
at a state, one affects the progress of the Markov process: the goal is to choose options 
that optimize the cumulative reward. In the case of the bus network problem, the actions 
in each state are whether or not to take a bus when it arrives, and the function we wish 
to optimize is the expected time to the destination. In the cases we consider here, the 
state also explicitly includes a time component, and hence it fits into the framework of 
time-dependent Markov decision processes introduced in [2j. Our work demonstrates that 
undei ceitain probabilistic assumptions, there are efficient algorithms to determine the 
actions that yield an optimal solution in the bus network setting. Our algorithms all ;eiv 
on dynamic programming, which is the fundamental technique for solving problems based 
on Markov decision processes [I. Cj. 

2 Buses with IFR Waiting Times 

2.1 Probability Preliminaries 

For completeness we cover basic definitions and properties of distributions we will use 
throughout the paper. Further information can be found in texts such as [7] or [8]. 

We will generally assume throughout that our random variables are non-negative with 
absolutely continuous cumulative distribution functions** and finite means, although our 
results can be modified to handle other cases, including lor example discrete distributions. 

"For our purposes, absolutely continuous means that the first derivative exists almost every where. 
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Fur a mumogal ivc random variable A with rumiilalive distribution function /' (/). we 
define I hi 1 survival ImieUon to la 1 F(t) = l - F(t.). Formally. A is said In have uirreusiny 
failure rut('(oi\)v IFR) iflogF(/) is ronravr on tin 1 support, of F. Thai is, F(t) is Inyconcave. 
Alternatively. if /(/) = F'(/) is the corresponding density function. I ho failure rati* is 
r(t) — J(t)/F(t). Tlu* condition that IogF(/.) is concave is equivalent, to the condit ion that 
r is increasing. 1 Tlie function r{f) satisties 

/n /(/) Pr(t<X<l + ±t\X>t) 

rlt) — -- — — lim . 

F{1) o Xt 

Informally. if A' represents a time spent waiting for a bus and X has increasing failure rate, 
it means the probability of the bus suddenly appearing increases the longer we wait. 

Similarly, A” has decreasiruj failure rate (or is DFR) if log F is convex on its support, or 
equivalently. X is DFR if r(t) is decreasing. 

Tin* mean residual life of X at time t is defined as 

m x (t) = E[ X - t | A* > t}. 

For example, if X represents the time until a bus arrives, the mean residual life m x (t) 
represents the average time until the bus arrives, given that it has not arrived during the 
first t units of time. Note that ni\(t) is defined to be 0 where F(t) = 0. The random 
variable X is said to have decreasing mean residual life or be DMRL if rn x (t) is decreasing. 
An interesting lemma left to the reader is that if A' is IFR then it is DMRL. but the reverse 
need not hold. 

The exponential distribution is both IFR and DFR. L’niform distributions are clearly 
IFR. Normal distributions can be shown to be IFR [5], as can gamma distributions with 
certain parameters [7]. In particular, any gamma random variable that is the sum of a 
finite number of exponential random variables is IFR. 


2.2 Form and Computation of the Optimal Schedule 

We begin with a theorem that shows the form of the optimal schedule when the waiting 
times for buses are IFR. 

Let T(s. d . h) denote the expected time to reach d from s using at most fz bus changes. 
Similarly, let 7)>(s. d . h) be the expected time to reach d from s using at most h bus changes, 
given that the rider gets on bus b now. 

We will focus on a single stop s with buses B\, B 2 < . . . , B^ stopping there. (For conve- 
nience. we do not include s in the variable description of the buses, but leave it implicit.) 
We will also use T x as an implicit shorthand for T^ t (s,d. h — 1). We let \\\ be the ran- 
dom variable representing the waiting time for bus i. and let \V t (t) be the random variable 
corresponding to the remaining waiting time. [\V t — t | \V t > /]. 

3 Here we follow the perhaps unfortunate but apparently standard practice and use “increasing** to mean 
'‘non-decreasing” and “decreasing” to mean “non-increasing” throughout. So IFR really means the failure 
rate is non-decreasing, even though IFR is the standard term. 
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[line me a lew additional concerns we mention line, [fa bus travels through multiple 
stops. we must assume that, the arrival distributions of buses at. each stop and the (.ravel 
Mines from stop to slop are independent. With this framework, we may assume without, 
loss ol generality that each bus travels only to a single next, stop: our results below can 
be modified so t he rider chooses t he best, of several possible stops along t in' route if t here 
me several stops. We will make this assumption in the theorem below. Second, suppose 
a bus visits the stop s but the rider chooses not to take it. It is not clear what arrival 
distribution we should use for the next, visit by a bus B t . The distribution ll\ represents 
the waiting time from our arrival; it is not dear that we should use the same distribution 
alter B, itsell arrives. Theorem 1 actually holds under any distribution for the waiting time 
of a “re-visit” by a bus B,. 


Theorem 1 Suppose that at every bus stop, the waiting times for the buses are indepen- 
dent random variables with increasing failure rate. Let B X .B 2 B k be the buses passing 

through a stop s. sorted in order of increasing T, (expected total remaining travel time to the 
destination d using at most h - 1 further bus changes ). Then the optimal travel plan from 
s to d using at most h bus changes has the following form: take B x whenever it arrives: 
take B 2 if it arrives before time t 2 ; take B 3 if it arrives before time t\; and so on, where 
the f* are decreasing (oo > t 2 > t 3 > ■ ■ ■ > 0). 

Proof: 


We first provide the important intuition. It is clear that in the optimal schedule, bus 
B { is taken whenever it arrives, since the expected time to reach d by taking any other bus 
must be at least as great as T\. 



T 2 >T l+ E{\V x U)b (1) 

then it. is better to wait lor bus B x . (Note that we have used in equation (1) that the 
waiting time for bus B x is independent of the arrival of bus B 2 .) The reverse is less clear: 
even if T 2 < T x + perhaps it could be better on average to wait for a following 

bus. hoping that it is B x but settling for B 3 or £?, if we are unlucky. In fact this is not the 
case; we will show that the condition 

T 2 <T x + E[W x {t)) (2) 

is sufficient as well as necessary for taking bus 2 at time t. Using this equivalence, and the 
(act that EfU i(f)] is decreasing in t (since IV x is assumed IFR). we can conclude that there 
is a threshold time t 2 such that the rider should take bus B 2 if it arrives before t 2 . where 

t 2 = inf{i :T 2 <T x + £[[[;(/)]}. 
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Nolo i hat at times where there is equality in I In* above expression. nit her unit mg or t aking 
the hns yields the same expected time, and hence without loss of generality we may say 
that the optimal schedule takes D> if and only if it arrives before /T The argument for 
other l)iis(*s will be similar, using induction on t he B 

To show that condition (2) is sullicient seems dillicult. since ostensibly we need to 
consider all possible other plans and arrival patterns of buses. We avoid i his complexity by 
introducing an option anjnmml. Let us suppose that when bus 2 arrives, we give the rider 
an option to force bus B> to wait: the rider can then board B> and have it leave at his or 
her discretion, or board another bus that arrives later. It is clear that this added option 
only helps the ruler. Moreover, for any plan in the original setting when' the rider waits for 
some other bus B x with i > 2 and boards that bus. there is a plan at least as good in the 
option setting where the rider exercises the option and takes bus Bo at the time it would 
have taken the other bus. Hence we need only consider whether the rider should take Bo 
now. exercise the option (taking Bo in the future), or wait for bus B { . 

In this context, however, choosing to take bus B 2 in the future can never be optimal. 
This follows again from the fact that E[\\\ (f)] is decreasing, so the longer B 2 sits idle, the 
more appealing B x becomes. Therefore, the only two potentially optimal choices are to 
board and take Bo immediately, or to commit to waiting for B\. This decision is precisely 
the test of Equation 2. resulting in the simple outcome that Bo should be taken if and only 
if it arrives before time t* z . 

Now let us consider the similar inductive argument for Bj. where j > 2. Let Z m be 
the random variable representing the time to reach d using at most h bus changes, if the 
rider waits for one of buses B\, Bo . .... B m and uses the optimal policy for these m buses. 
Similarly, let Z m (t) be the time to reach d after having already waited t seconds at <s. We 
know the form of the optimal policy on j — 1 buses via the inductive hypothesis. Clearly 
it is necessary tfmt 


Tj < E[Zj^(t)) (3) 

lor it to be optimal for the rider to take bus j if it arrives at time t. To show that (3) is 
also sufficient, it suffices to show that Zj„ i is DMRL from the option argument. 

We use the fact that the distribution of the \\\ are IFR to show that is DMRL. Un- 
fortunately. a direct argument is somewhat difficult, as a natural expression tor E[Zj-\(t)\ 
is difficult to write: the buses involved with the calculation of Zj-i(t) change with t. (The 
correct expression is therefore a sum. split according to the condition of when the first 
relevant bus arrives.) 

We instead show that Z^\ is DMRL over successive intervals. Inductively, it suffices 
to consider the interval [0,^.j]. The argument is simplified by constructing a new random 
variable which is similar to x except for the following changes. First, we replace the 
waiting time distribution for bus j — I by a distribution that is equal to Wj-\ for all t < t*_ { 
and is with all remaining probability. That is. for the variable we assume that bus 
j — 1 arrives at time t*_ { if it has not otherwise arrived. Note that £*[1 j _ i (t )] = E{Zj_i(t)} 
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ovi'i t li«' intrt v a I jl). I j | j. as I his change dors nul. allerl, l Ik* experl < •« I I ravel i mie over this 
iuteivul. Sec oikI. Ini )j_ | vve assmiK* dial it (he I > us B, is boarded. I lie remaining travel 
I ime is exactly t lie expectation T, insl ead o| a random variable. Again. wit h this change vve 
still have £[V_ / _t(/)| = E[Zj_[(t.)\ over the interval [0. /J_J (by linearity of ex|)ectat ions). 
Hence it, sullices to show that is IFR. to prove Z,_, is DMR.L. 

Note that 


Pr 0j-i > x) = Pr(Uj_, > x - r,_,) • Pr(Uj_ a > - Tj_>) ■ . . . ■ Pr(H', > x - T t ). 

But the survival functions of every term in the product on the right hand side are logconcave 
in x. since the R, are IFR. Hence the lelt hand side is logconcave in x. and since the left 
hand side is the survival function of }j_ r vve have that Y J _ l is TFR. Hence inductively Z,_, 
is D.MRL and the optimal policy has the form given in the statement of the theorem. 

Finally, note that t’ < since the Z,(t) are decreasing in i and the T, are increasing 
in i. □ 


Theorem 1 immediately provides an '‘elementary ” proof of the main result bv Datar 
and Ranade (Lemma 3.1 of [3]). 

Corollary 1 When arrivals for all buses are Poisson , then the optimal schedule has the 
following form: take one of buses B\. Bo* .... Bj as soon as it arrives. 

Proof: In this case, the H',(f) are independent of t. so in the proof of Theorem 1 vve must 

have that the t* are all infinity or 0. □ 

In the case of Poisson arrivals. Datar and Ranade show that the optimal schedule and 
the resulting expected travel times can be computed exactly efficiently [3]. Theorem 1 also 
■suggests a natural wav of computing an optimal schedule for our more general setting. Let 
Q be the maximum number of buses that pass through a station and 5 be the number 
"I sl.i-t<*ns. 'Ac m;r compute optim.l plans involving at most h bus changes indnctivelv. 
This first involves sorting the buses at each station according to the time to reach the 
destination using h 1 further bus changes. Then we compute successive values of t' for 
each stop. 

For distributions more complex than the exponential, computing the t * is non-trivial. It 

requires computing the expected time to reach the destination using buses B x . B 2 B t . x . 

which may require multiple integrations over the corresponding distributions (to find the 
distribution of the time the first of these buses that the rider will take arrives and the 
corresponding probability for each bus; note the time the bus arrives and which bus it 
is are correlated in our case!). In practice we expect computing the <* would be done 
numerically to suitably high precision, or possibly even by Monte Carlo simulation. We 
believe that the numerical analysis issues are outside the scope of this paper. Hence we 
simply assume the existence of a “black box” calculator for computing the <*. Given this, 
vve have the following corollary: 

Corollary 2 The optimal travel plan wider the conditions of Theorem l can be computed 
in polynomial time, assuming a black box for computing the t’ . 
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/You/’. The 1 1 »t al work from tin* sorting is (){hSQ iog(^). ami lliuv will In* O(hSQ) rom* 
pulatioiis using I hr black box. □ 

2.3 Are IFR distributions necessary? 

One might ask if IFR dist ributions arc required for the result o( Theorem l. The proof itsell 
requires only that the time to reach tin* destination if the rider waits for lower numbered 
buses is DMRL; other sets of distributions may have this property without being IFR. One 
natural suggestion is that perhaps if the IT, are simply DMRL then this is sufficient. We 
show, however, that Theorem 1 does not hold under this condition. 

Theorem 2 The result# of Theorem l fail to hold if the II, are only DMRL. 

Proof: We construct a counterexample. Let A be uniform over the range [0.2] U [4. 12]. 

It is simple to check that A' is DMRL. Now suppose there are two types of buses traveling 
from point a to point b. each with waiting distribution AY Fast buses have a constant 
travel time of 1: slow buses have a constant travel time of 2. Clearly if a fast bus come the 
rider should always take it. Intuitively, however, if there are enough fast buses, the rider 
should not take a slow bus that arrives early, because it is likely that a fast bus will shortly 
come. Once the rider has waited almost two time units, however, he should take a slow 
bus if it comes, since no buses arrive in the interval [2.4] and otherwise he is likely to end 
up waiting substantially before a fast bus appears. A calculation shows that if there is one 
slow bus and twelve fast buses, we should take a slow bus only if it arrives in the interval 
[0.558.2]. □ 

The counterexample of Theorem 2 can be modified in various ways. For example, we 
ran change the distribution V so that i r s support is a closed interval by adding a sp'iall 
weight over the interval [2.4], Also, by considering distributions A that consist of more 
disjoint intervals, we can construct examples where the proper times to take the slow bus 
consist of two or more disjoint intervals. The point behind Theorem 2 is that even though 
each fast bus has decreasing mean time to live, the random variable for the time until the 
first fast bus arrives does not. That is. the family of random variables with decreasing 
mean time to live is not closed under minimization, while IFR random variables ait. 

An interesting open question this counterexample raises is whether there is a natural 
wav to relate the complexity of the waiting time distributions and the complexity of the 
form of the optimal schedule. 

3 Buses and Trains 

We now consider another issue suggested in the conclusion of [3], and also examined in 
[2]: networks with mixed forms of transportation, such as buses and trains. Recall that in 
our model buses have an associated random waiting time distribution and an associated 
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Trains leave at quarter past, 
quarter until the hour 


Figure 1: A basic bus and train network. Arrivals of buses from A to B and from A to C are 
each Poisson with an average wait ol ten minutes. Travel times are constant. Trains from 
B leave on the hour and the hall hour! trains Irom C leave fifteen and forty-five minutes 
into the hour. 

random travel time distribution. We shall use the term train to refer (metaphorically) 
to transportation that arrives and departs at fixed absolute times. For example, consider 
Figure 1. From station A. the rider may catch a bus to either station B or station C. We 
assume the travel time from A to B is a constant five minutes and the travel time from A 
to C is a constant ten minutes. Arrivals of buses that travel from A to B are a Poisson 
process, with an average waiting time of ten minutes: the same holds for buses from A to 
C. At both stations B and C there are trains that run to station D. with the travel time on 
the train being one hour. Trains from B leave on the hour and the half hour: trains from 
C* leave and forty-fir** minutes into the hour 

This simple example highlights that introducing trains leads to substantial difficulties. 
Of primary importance is the introduction of absolute time: we are not only concerned with 
how long the rider has spent at the bus station, as in the problem with only buses, but 
the actual time until the trains depart. The expected time to reach the destination D from 
train stations B and C is not constant, as it was in the pure-bus setting of Section 2. but 
depends on the time the rider arrives at D. Because of this, the ideas behind Theorem 1 
no longer directly apply. In particular, there are times where the rider should pass up the 
bus to B in order to wait for the bus to C. and other times when the rider should do the 
opposite. For example, if a bus to station B arrives at station A just four minutes before 
the hour, we know that taking the bus will cause us to wait at station B. The rider is 
better off waiting for a bus to station C. and possibly catching a bus to station B later if 
necessary. 

In this section, we present an approach for handling mixed networks of buses and trains 
in the case where bus arrival times are discrete geometric random variables, which can be 
used to approximate the case of the continuous Poisson arrival process. Our method will 
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I m» to set up (lit* problem as a large dynamic pro^ra m tiling problem. or ^<|iiival<ait I v. as 
a Markov decision process. (Dynamic programming is a standard technique for Markov 
decision processes: again. see [1, G].) We first present our approach via the example above, 
and then discuss the general framework for larger problems. 

We first clarify here why we limit the l>us arrival processes to be discrete geometric 
random random variables. From our example, wo can set' that it is possible in mixed bus 
and train networks that the rider chooses not to take a bus at some time, only to take a 
bus on the same route later. If the bus arrival process is a geometric random variable, the 
fact that a bus has previously arrived need not be recorded in the state space: we may 
forget that the bus has arrived, as it does not affect the arrival ol future buses. (This is the 
memory lessness property of geometric and exponential random variables.) II. however, a 
bus has a more complicated arrival process, then the last time a bus on that route arrived 
may be relevant information for determining the arrival ol the next bus on that route. 
Keeping track of such information as the last arrival ol each bus would lead to a more 
complex, higher-dimensional state. Although handling such a state is theoretically leasible 
using the techniques we suggest, we do not address this issue here. 

For our problem, the state space will be pairs (s,t), where $ is a station and t is the 
current absolute time. To be at the state (s, t) denotes that the rider is still waiting at 
station s at time t. In order to make the underlying state space countable, we must assume 
time is discretized. Moreover, for the state space to be finite, we must also assume an 
ending time for the process. For example, we may assume that the buses and trains start 
running at noon and stop running at midnight, at which point one must call a friend for a 
rifle. To penalize this action, we make the cost associated with it very high but finite (such 
as two hours). 

Discrete geometric random variables can naturally be used to approximate continuous 
exponentially distributed random variables: the error in the approximation depends on the 
granularity of the discrete time scale. Hence this approach can be used to approximate 
behavior when bus arrival processes are Poisson. On the other hand, the number of states 
required is proportional to the number of discrete time steps being modeled. 

Finally, for convenience we will assume here that each bus travels from our current stop 
to a unique other stop as opposed to multiple stops. The case where buses have multiple 
stops can be handled in an similar fashion (with a possible increase in the size ot the state 
space). 

3.1 The Dynamic Program 

We first consider our example. For convenience let us assume that time is discretized in 
minutes, and buses leave on the minute. Hence for example the rider may begin at state 
(A. 11:59 am), and if a bus arrives in the intervening minute, he may get on the bus and 
leave station A at 12:00. Of course the rider may choose not to get on the bus, in which 
case the rider will be at state (A. 12:00 am). We wish to optimize the expected time of 
arrival at station D. 
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Hu- possible anions at each stale consist of the list of buses we will lake if such a bus 
arrives at. the station over the next, minute from that. lime. We assume in this discretized 
version that, buses may arrive in the same interval, so that, our possible action at each state 
is a sorted list o( buses that we will take if a bus arrives, with the sorted order giving a 
preference it two buses arrive at the same time. 

In our simple example, let E(s.t.) be the expected time to reach D from state s at t ime 
I. Note that EiB.t) and E(C.t) are trivial to compute. Let c = 1 - exp(-O.l) be the 
probability that a bus headed for B (or equivalently for C) arrives at A during a minute. 
We have the recurrence 

E{.\.t) = (1 - z)-E(.\.t+ 1) + 

c(l - ;) niin(£(A. t + 1). £"(B. t. + G)) + 

~ ( 1 — ') min(£(A. t + 1). E{C.t + 11)) + 

~ 2 min(E(A. t + l). E(B.t + 6). E(C.t + 11)). 

We solve this recurrence (or decreasing t. From the recurrence we can naturally derive the 
correct actions: for example, if E{ A, t + 1) < £(B, t + 6) then we will not take the bus to 
B. The results for our example are given in Table 1. 


Time 

0-2 

3-4 

5-21 

22-24 

25-30 

Action 

. 

C 

C.B 

B 

B.C 

re 


Table 1. At more than a few hours from the end of the day. the optimal strategy has a half 
hour cycle. At states where t is 0 to 2 minutes over the half hour, plan onlv to take bus C 
if it arrives over the next minute. 

Theorem 3 llu. travel plan for optimizing the expected travel time for networks with buses 
and trains, where buses have discrete geometric arrival distributions, can be computed in 
time polynomial in the number of stops, the maximum number of buses and trains at a 
stop, and the total number of time units simulated. This holds even if bus travel times are 
random variables that depend on the time of arrival to the station. 

Proof: We provide a more general framework and corresponding bounds on the time to 

compute the optimal strategy. We use dynamic programming, computing the E(s.t) in 
reverse temporal order; that is, we start at the end of the process, and compute E(s.t) 
for all s (in any order) using already computed values E(s. u) with u > t. Suppose that a 
maximum of Q buses or trains pass through any of S total stops, and our process lasts for 
T units of time. For each time state (.s,t), there are at most £,= 0 (?)i! possible actions, 
as each ordered subset of the buses and trains are a possible action. However, following 
the idea of Theorem 1 we can simplify considerably by sorting the buses and trains by the 
expected time to reach the destination if we choose that option at that time. Every action 
that is better than waiting at the current stop (i.e. better than E{s. t -f- 1)) is one that will 
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In' taken, and I lie sorted order provides I lie prelerenee. Note that we can soil the transport 
opt ions, hut t he results may be different. Cor different time steps. 

In our example, vve have that, the Inis travel times are constant. In this ease, once we 
have the sorted order, computing E(s.t) can lie done in time (){Q) by considering the 
arrival possibilities in sorted order. For convenience suppose there are only buses at the 
station (trains are easy to handle, as they are either ready to leave or not). II the first bus 
arrives, vve take it: if not. but the second bus arrives, we take that: and so on. There are 
only O(Q) possibilities to consider. Hence the total time to compute optimal schedules in 
this case is G(STQ logQ). 

If instead bus travel times are given by a fixed discrete random distribution, or even 
a discrete distribution that varies over time, this only increases the work to compute the 
expected times to reach the destination by a factor of ()(T). for total work 0 ( S TQ log Q 4- 
SQT 2 ). 

(Note: in the case where bus travel times are given by fixed discrete random variables, 
standard convolution techniques may reduce the total work to (){STQ log Q 4- SQT log T): 
however, it appears some additional assumptions are necessary for these methods to apply. 
We will explain further in the final version of the paper.) □ 

Although the complexity of these solutions may be large when computing over long time 
intervals, they appear feasible for reasonable-sized systems. We also note that another 
advantage of this setup is that we can handle value functions more general than the expected 
travel time: for example, we could use the same approach to maximize the probability of 
reaching our destination by a certain time. 

To summarize, this framework improves over previous work in the following respects. 
In comparison to the work of [3]. we show that handling buses with Poisson arrivals and 
trains is possible: moreover, we show that the simple form of the optimal schedule we have 
shown in Section 2 P not possible in this setting. In eomparisc '"it h previous work on 
Markov decision processes such as [2. 9]. we have shown how to handle the problem of 
waiting for multiple buses at a station in the case of a geometric arrival process, which 
leads to a relatively simple state space. 


4 Conclusions and Future Work 

We have expanded previous work on stochastic transportation networks in two ways. First, 
we provided an algorithm for finding optimal schedules for bus networks where bus arrival 
distributions have increasing failure rate. Second, we have given an algorithm for finding 
optimal schedules in mixed networks of buses and trains when the bus arrival distributions 
are discrete geometric random variables. We plan to implement these algorithms and test 
them on artificially generated and real data in the near future. 

There remain many open questions to pursue: we suggest two here. First, fast approx- 
imation algorithms would be useful, especially tor transportation networks that change 
often. Moreover, approximation algorithms may allow more general distribution classes to 
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Ih* hainllnl. Stroud. w<* might, consider the sit.ual ion when die transportation network inav 
provide addil ional information. Fur example. buses equipped with global posit inning equip- 
iiHMit, and wireless communication may lx* able to provide* their position. In this situation, 
a ruler determining whether or not to get on a bus may have morn detailed information 
availablf* about the waiting time for other bust's. 
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